Entry Name:  SJTU-Dong-MC2

VAST Challenge 2015
Mini-Challenge 2

 

 

Team Members:
Shanghai JiaoTong University

Xiaoju Dong, xjdong@sjtu.edu.cn   PRIMARY

Peicang Guo, gpc94@sina.com

Luning Wang, 1171827676@qq.com

Xin Fan, 527309993@qq.com

 

Student Team:  Yes

Did you use data from both mini-challenges? No

 

Analytic Tools Used:

Excel

D3

Gephi

Approximately how many hours were spent working on this submission in total?

50 hours.

May we post your submission in the Visual Analytics Benchmark Repository after VAST Challenge 2015 is complete? Yes

 

 

Video:

 SJTU-Dong-MC2-video.wmv

-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

Questions

 

MC2.1 ¨C Identify those IDs that stand out for their large volumes of communication.  For each of these IDs

 

a. Characterize the communication patterns you see.

b. Based on these patterns, what do you hypothesize about these IDs?

 

Limit your response to no more than 4 images and 300 words

 

a.

       IDs stand out for their large volume of communication are as follows. The first two are extremely and the third is a representative of several similar ones.

       The radius of one circle indicates its communication traffic. If two IDs have communication, their circles will be linked by a line. The distance of two circles is negative related to the communication times the IDs these circles represent.

Figure 1-1

       The Figure 1-1 is the pattern of ID1278894 (the blue circle) with the largest communication volume. Its pattern is radiation. The black circle also near the center represents ID839736.

839736-image

Figure 1-2

       Figure 1-2 is about the ID 839736 with the second largest volume. The communication pattern is radiation.

       Except for these two IDs whose communication volumes are unusually large. There are several IDs have relatively more communication times. We choose ID195725 to represent them.

Figure1-3

       We can get that any two circles including ID195725 in the big circle in the left of the graph have records of contact. The pattern of this part is a complete graph. Besides, ID195725 also have large amount of communications with other IDs.

 

b.

place of broadcast

Figure1-4

       The location of ID1278894 and ID839736 in the three days are as above, which means that they remain in the same place in all time.

       If we associate this with the graphs we draw in question (a) we can see ID839736 made 2526 calls and received 2503 calls. Its communication records with anyone are all under 10. So it may be a service desk with a person in charge, who send and receive message.

       ID1278894 is connected to all groups and individuals by sending message only. So it may be a broadcast center.

       ID195725 acts as a group leader connecting with every member of his own group and some other people including ID1278894 and ID839736.

 

MC2.2 ¨C Describe up to 10 communications patterns in the data. Characterize who is communicating, with whom, when and where. If you have more than 10 patterns to report, please prioritize those patterns that are most likely to relate to the crime.

 

Limit your response to no more than 10 images and 1000 words.

Pattern1

pattern1

Figure 2-1 angle

       This figure shows the pattern of angle. In the example above, each group has one person who called to the other two. We take one for instance to illustrate who called whom and where it happened.

       1235058 was making calls to 880619 and 487752 at Tundra land.

Pattern 2

Figure 2-2 radiation

       This figure shows the pattern of radiation. One is the nucleus of the group and calls out to all the others. In this picture, 839836 (probably the person at the service desk) communicated with tourists at Entry Corridor.

Pattern 3

pattern3

Figure 2-3 Y scale

       This figure shows the pattern of Y scale. The pattern is exactly named after its shape. In this picture, 1531050 was calling to 52462, 295951 and 1753644 at Tundra land.

Pattern 4

Figure 2-4 double nuclear

       This figure shows the pattern of double nuclear communication. There are two leaders of the group, both connecting to all the others. The leaders may or may not be connected. 1563610 and  195664 acted as group leaders and communicated with ID1214689, ID669199, ID399712, ID161071, ID429192 and several other IDs at Wet Land.

Pattern 5

Figure 2-5

       This figure shows the pattern of multi-nuclear communication. There are more than two leaders of the group, both connecting to all the others. We choose this group as a typical example. 1688395(red, at Entry Corridor), 1581087(red, at Entry Corridor), 445493(green, at Wet Land) and 19249(pink, at Coaster Alley) communicate with members (at Entry Corridor) .

Pattern 6

Figure 2-6 complete graph

       This figure shows the pattern of complete graph, which means that nearly every one in the group is connected to all others. The communications above happened at Tundra land.

Pattern 7

Figure 2-7 linear

       This figure shows the pattern of linear communication.Linear pattern means that one calls to another, who calls to yet another one. This pattern forms a directed line and emphasizes hierarchy. This is the main difference between it and angle pattern.

Pattern 8

Figure 2-8 mixed pattern

       This figure shows the mixed pattern of radiation and complete graph. During a long period of time, communication patterns are often tangled. In this part we use this mixed pattern to give an example.The center circle is ID1742503 at Wet Land which communicated to a group of IDs such as ID229760, ID1570276, ID771453 and ID2085681. He also communicated to some active IDs like ID2022453, ID195725, ID515929, and ID869116 at Wet Land.

 

MC2.3 ¨C From this data, can you hypothesize when the crime was discovered?  Describe your rationale.

 

       Limit your response to no more than 3 images and 300 words. 

Figure3-1

The X-axis indicates time and the Y-axis indicates the total communication volume. The blue line is Friday, the green one is Saturday and the grey one is Sunday.

From the picture we can see that the main difference between Sunday and the other two days appears at the number 210 to 260. The first peak may indicate the occurrence of crime and the second peak may indicate the discovery of crime. So we can come to conclusion that the crime was discovered at the number 240 which stands for the time 12:00.